Towards fast and adaptive optimal control policies for robots: A direct policy search approach
نویسندگان
چکیده
Optimal control methods are generally too expensive to be applied on-line and in real-time to the control of robots. An alternative method consists in tuning a parametrized reactive controller so that it converges to optimal behavior. In this paper we present such a method based on the “direct Policy Search” paradigm to get a cost-efficient control policy for a simulated two degrees-of-freedom planar arm actuated by six muscles. We learn a parametric controller from demonstration using a few near-optimal trajectories. Then we tune the parameters of this controller using two versions of a Cross-Entropy Policy Search method that we compare. Finally, we show that the resulting controller is 20000 times faster than an optimal control method producing the same trajectories.
منابع مشابه
Direct adaptive fuzzy control of flexible-joint robots including actuator dynamics using particle swarm optimization
In this paper a novel direct adaptive fuzzy system is proposed to control flexible-joints robot including actuator dynamics. The design includes two interior loops: the inner loop controls the motor position using proposed approach while the outer loop controls the joint angle of the robot using a PID control law. One novelty of this paper is the use of a PSO algorithm for optimizing the contro...
متن کاملFunction Approximation Approach for Robust Adaptive Control of Flexible joint Robots
This paper is concerned with the problem of designing a robust adaptive controller for flexible joint robots (FJR). Under the assumption of weak joint elasticity, FJR is firstly modeled and converted into singular perturbation form. The control law consists of a FAT-based adaptive control strategy and a simple correction term. The first term of the controller is used to stability of the slow dy...
متن کاملDirect Optimal Motion Planning for Omni-directional Mobile Robots under Limitation on Velocity and Acceleration
This paper describes a low computational direct approach for optimal motion planning and obstacle avoidance of Omni-directional mobile robots within velocity and acceleration constraints on the robot motion. The main purpose of this problem is the minimization of a quadratic cost function while limitation on velocity and acceleration of robot is considered and collision with any obstacle in the...
متن کاملAdaptive Voltage-based Control of Direct-drive Robots Driven by Permanent Magnet Synchronous Motors
Tracking control of the direct-drive robot manipulators in high-speed is a challenging problem. The Coriolis and centrifugal torques become dominant in the high-speed motion control. The dynamical model of the robotic system including the robot manipulator and actuators is highly nonlinear, heavily coupled, uncertain and computationally extensive in non-companion form. In order to overcome thes...
متن کاملControl Theory and Economic Policy Optimization: The Origin, Achievements and the Fading Optimism from a Historical Standpoint
Economists were interested in economic stabilization policies as early as the 1930’s but the formal applications of stability theory from the classical control theory to economic analysis appeared in the early 1950’s when a number of control engineers actively collaborated with economists on economic stability and feedback mechanisms. The theory of optimal control resulting from the contributio...
متن کامل